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OBJECTIVE :  To  develop  a  model,  called  Geocore,  for  protein 
and  polymer  folding. 

APPROACH:  The  model  is  united  atom,  with  3-6  discrete  phi  psi 
backbone  options  per  amino  acid  and  sidechains  are  fixed  in 
their  predominant  rotameric  state,  chosen  according  to  the 
relative  populations  in  the  Protein  Databank.  There  is  one 
hydrophobic ity  and  one  H  bond  parameter,  steric  violations 
are  not  allowed,  and  the  conformations  are  searched  by  an 
extensive  branch- and-bound  method. 

ACCOMPLISHMENTS :  We  used  lattice  models  first  to  explore 
principles  of  fast  conformational  search  processes  and  simple 
energy  functions.  We  developed  theory  for  inverse  folding, 
on  sequence  design,  that  showed  that  this  problem  is  in  a 
much  simpler  class  of  computational  difficulty  than  protein 
folding.  We  developed  a  very  fast  lattice-based  search 
methodology  that  finds  globally  optimal  conformations  of  the 
longest  chains  yet  found  by  any  method. 

We  then  incorporated  some  of  the  same  conformational  search 
principles  into  an  off-lattice  method,  called  Geocore.  We 
have  tested  Geocore  now  on  18  peptides  -  all  the  peptides 
having  known  structure  (i.e.,  in  the  PDB)  shorter  than  31 
amino  acids  (the  limit  of  the  current  search  method) .  Eleven 
of  these  peptides  are  water  soluble.  Geocore  does  well  as  a 
filtering  algorithm,  meaning  that  keeping  the  400  lowest- 
energy  configurations  out  of  more  than  109  explored  per 
typical  peptide,  is  sufficient  to  retain  a  conformation  that 
is  native-like  by  the  criteria  that  the  secondary  structures 
are  similar  and  RMSD  errors  are  less  than  about  5  angstroms, 
often  about  3  angstroms. 

The  limitations  of  the  method  are  in  the  limited  phi  psi 
options  and  the  slow  search  speeds.  We  currently  have 
independent  funding  to  pursue  improvements  in  these  areas . 

CONCLUSIONS :  We  have  found  that  branch- and-bound  search 

methods  are  competitive  in  speed  with  standard  methods,  like 
Monte  Carlo,  for  models  having  discrete  phi  psi  options.  The 
advantage  of  the  Geocore  search  is  that  we  can  be  assured  of 
global  minima,  and  this  gives  a  deep  test  of  the  energy 
function.  A  main  conclusion  from  this  work  is  that  simple 


energy  functions,  with  few  parameters,  are  arguably  no  worse 
than  more  complex  ones .  We  believe  Geocore  is  a  viable 
early-stage  peptide  folding  model. 

SIGNIFICANCE :  This  work  takes  steps  toward  a  computer 
algorithm  that  can  compute  the  structures  of  peptides  and 
proteins,  ab  initio,  from  their  monomer  sequences. 
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